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BEMARKS 



Claims M, 7-19, and 21-38 are pending in the application. Claims 1, 19. 23. 
24, and 25 are currently amended. 

The Examiner is thanked for granting the interview on August 8, 2007. 

In the Interview the issue of the use of prior knowledge about L structure of 
the document being analyzed was raised (as described in the disclosure in numerous 
references to "ioMigeat structure analysis"). In light of the interview, Applicants 
now add clarifying language to the independent claims to indicate the use of said prior 
knowledge during the analyzing. 

In addition, the issue of the internal publication format being m a mark-up 
language (as previously claimed in dependent claim 2. now cancelled), such as XML 
was discussed. In light of the interview, Applicants now add the language "said 
mtemal publication format comprising a mark-up language" to all independent claims. 

Claim Rg^ectiy ns - fTsr 

In this section of the Office Action, claims 1-4. 7-19, and 21-38 were rejected 
under 35 USC 103(a) as being unpatentable over Fenrel et ai., US006199082B1 in 
view of Cooperman US006377704B1 . 

Favorable reconsideration of this rejection is respectfully requested since as 
will be shown below, the above-amended and previously presented claims are both 
novel and inventive over the prior art cited by the Examiner. 

The present disclosure teaches a system that automatically carries out 
MRent stfuofure analy sis of a scan or like existing passive digital verdon of an 
existing document, The scan, or like existing passive digital version, preserves the 
layout of the document, but has no further structural or other information or metadata 
to differentiate any portion of the document from any other portion. While the passive 
digital version provides an image of the page layout for the human eye it has no 
meaning for computer programs. 

Within the passive digital version of an existing document, no object is 
distinguishable from another - indeed, no object is identifiable as an object per se. 
Rather, the passive digital version is a collection of undifferentiated bits and bytes. 
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Not only are objects within the existing passive document not recognized 
neither are any relationships that may exist among them since, as noted, the objects 
themselves are not identifiable as objects. 

^° ^""^'^"'^'•^^^''""''■^"'^™inn,li,,n ilnni m to permit 

any sort of consideration or operation on meaningful subsets of the document (where 
meanmgful subset" is another name for an "object" or "objects" as described in the 
application). 

Examples of desirable consideration and operations are 

• search and retrieval of specific objects in the documents; 

• content-sensitive and structural navigation; 

• re-organizaUon of the document's componem parts; 

• re-publishing all or a subset of the documem in a dilTerem presentation 

format; 

• le-publishing all or a subset of the documem on a different platfonn; 

• identification of relationships between subset objects, such as an article 
and its accompanying illustration, headings etc.. or part A of an article 
with part B of the same article; 

. idenUfication of of objects for additional, obiect-snenfir 

ETQcessiiig. such as. but not limited to. object-specific compression. 
OCR, and semantic analysis; 

• identification of relationships to other documents. 

An aspect of the present invention is to infuse meaning and structure to an 
existmg document, where such meaning and structure is lacking prior to the operation 
of an embodiment of the invention. 

nie steps to achieve an enhanced documem augmented with metadata flilly 
describing the components of the document and the relationships among them (and 
back to the original passive document) are described in the application. This is no 
merg trangPopnation of d iir% but involves sophisticated analysis of the passive 
beyond basic sep m entaUon into rrrfanp ler or OCR of textual regions. 

The results of the sophisticated intelligent analysis are stored in a repository 
that includes various transformations of the original passive document (e.g., cropped 
and at different resolutions) as well as the metadata for each component derived by 
the analysis. The metadata inco^jorates simple information, such as references to the 
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original font and size (for a text object) as well as intricate det... ..j.^., 
propeyt,^ a^ id relafionshi M to oth er objec t , as well as relationshin. to th. fi,„ 
page from which tt was extracted 

A product of the present invention is the use of the data stored in the 
i^positoo. to republish the original document's contents in new and innovative ways 
For example, an embodiment of the system can take a very large collection of 
archived newspapers scanned from where they were previously stored on microfiche 
and automatically provide them as a collection of indexed, searchable, stand-alone 
arfcles. mcluding their associated graphic illustrations and links between related 
articles. No such ability is taught in the prior art. 

TTie foregoing is only one example of the benefits of the present invention 
Add,tioaal benefits are described in the application and others will also be evident to 
one skilled in the art. 



Prior art cited by the Examiner 

The Applicants respectfully thank the Examiner for the new citation of the 
prior art taught by Cooperman US006377704B1 - filed 4-30-1998 (hereinafter 
Coopemian) and welcome the opportunity to demonstrate how the present invention is 
both novel and inventive over the teachings of Cooperman and other prior art cited by 
the Examiner, namely Fen-el, et al US006377704B1 - filed 07-17-1995 (hereinafter 
Ferrel), 

Examiner has rejected all claims as being unpatentable due to obviousness by 
Ferrel in light of Coopennan. The appUcants respectfully traverse and submit that 
Examiner has not made out ^pnn,a facie case of obviousness as demonstrated in the 
detailed discussions below. 

Additionally, Examiner has stated "Using the broadest reasonable 
interpretation, the Examiner equates the claim^ converting data from one format 
to a different format as equivalent to a set of predefined styles along with supporting 

macros/' 

Applicants respectfully refer to claim 1 wherein is claimed converting each 
object to an internal puWication format and to claim 19 wherein is claimed a 
mark-up language distiller module for converting the newspaper from said 
Original format to a mark-up language format and a publisher senrer for 
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object (0 an imanal publicaUoa fonaat. 

have cta^ed oaly U,e ^ ^ ^ 

(along .d<«,i„^ i„ ^ ^^^^ 

F™i no, Coopenmn «.ohes nor even hinte a. id« of oo„v«Ung daufeute 
vi nOM, ^u,^ .-.ro ^^d in ,h, „ ^^^^ ^ ^ ^ 

up language fnrmaf 

In light Of tho Exan,in«.s appar« mi^eading of Appli«„t». cidms 
AppllWMs «i,pe«ft||y withdrawal otlha a»li,y of the action. 

Fetrel discloses a m„,a„,edia publishing system for creating online 
public^om. wh.«i„ a. contents and design aspects of the online pubUshing process 
are sepa,«ed into distinct components before being t^^itted to the c«s.«„c^ 
oomputers. Separation of a,e content, and .he design aspects of the online 

P»bl.ca„o„s facilitates efficient authoring of doc„n,an,s by ind^endent con.«,t. 

«ea«>rs and design.crea.„„ as well as efficient distdbution of published documents 

^ enaWng the transmission of, «,r e«mple, high-<,uali.y tiUes over low-speed 

communicaUons links subject to loss of connectivity. 

Coopen^n, teaches an innovation i„ the «aln. of OCR. specifically 
.toiyng insets in a scanned doc>unen. so as to facilitate detennining ^ 
readmg order and creating, fcr example, an editable approximation of the original 



As will be shown in .he detailed discussions below, to present invenUon is 
bod, novel and inventive over the .cachings of Ferrel a«l Coope™»n, as well as the 
combination hereof, should such a combination be considered likely by on. .,ai„«, i„ 
the art of either of the fields to which said teachings relate. 
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The Examine, h« ^ one of ordi^ry skill i„ fte « would have 

con^Mcd *e .eaehing. of Fen., ^„ coop=™» Tl» AppKcane 
«, ».ve«e. Fetel ...che. . f„ p„,„^„g , 

a«tar,„g new, »i^„al where 4^1^ element a:. »u.o«d and managed 

d.«.ncCy fton, content: -V.. ^ ^ ^^^^^ 

cle«, .eparauon of ocnten, and design. In Ms contex, con,«,. is defined as U» actual 
da^ fta. ,s to be displayed to the user. design of a project is how Uut information 
dtspUyed ,0 .he user (e g., its format on the computer screen)." (Ferrd col 8 
tas 4 9). Thrs is reinforced b, FerKPs „hserv«ion that MP system supports' 
s^ such as Microsoft Won, and Standard Generali^ Maricup La,^,, 
SGML) to ensure that the content p^vider's investment in aiah^ can ^ g,,,, 
everaged. The MP system abo jsgfe standard HyperText Markup Lanpage (HTML) 
documents so that existing mui doannents can be easily converted to more 
»ph>s<.c«ed applications. AdditiotuUy, U„„gh standard tools 

supports OLE server capabilities can be used ,„ c^ate c«,tent embedded'in an 
MPS title. By suppotting additional mSsi_Sk^!m!. the MPS can also 
accommodate other tools (for example m^Mmmmimm." (Fen.1 col. 
«, hne 66 to col. 6} line 10). emphasis added). Fetnrs "existing tools" are ones for 
coating new content. Tie Ferrel sys,«n "™ds", i.e. imports mul and can 

accommodate "standard file formafc" Mn«« ♦v 

me tonnats . None pf tjiese sntteest the .wp »f ir^^.m^rnr 

stnictuf^ analysis on nre-evi^mf, ^^n-nrnt" 

Cooperman, by contrast, teaches a minor improvement to OCR systems 
namely mset detection to improve detennination of correct reading order and 
producing editable documents in correct reading order (see above). Tt. teachings of 
Coopenuan are not related to the field of modem publishing systems, but rather to the 
conversion of pre-existing documents to machine-i^adable fom.. e.g., for archival 
purposes. 

One trained in flre ar, of archival analysis is not typically also involved in the 
fie d of modem pubUshing systems and vice versa, mus. the teachings of Coopenn^ 
and Ferrel would not ordinarily be combined. 

ta any case. «, shown above and further shown in the d«ailed discussion of 
the cldms below, fl» t^, „f j^^^, ^ ^^^^ lactiflyenSvej^ 
m^^vative teac Mni T, of p,^ff.n, app i i^K,, . „^ i„„„,,^ ^ 

teachings are lacking from the combinaUon as well, 
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As one specific example, both Feirel, and Coopennan. and the combination 
thereof, lack automatically analyziag the newspaper to decompose the 
predetermined layout of each page of the newspaper m the original, existing 
format into said plurality of blocks, each block representing an object; said 
analyring furthermore identifying at least one logical n^btionship between said 
blocks; converttag each object to an internal pubUcaHon format, said internal 
publication format identifymg and preserving s«d Internal structure of said 
blocks within said objects, said intemal publication fonoat furthermore 
preserving said layout as a relationship between said objects; said internal 
publication format furthermore preserving said logical relationship; and 
rendering said internal publication format to incorporate said objects, said 
layout and respective internal structures u,d said logical relationship in the final 
publication format. 

The aforementioned lack in the cited prior art, as well as addiUonal new and 
innovative teachings of the pi^sent invention are described in detail in the secUons 
following. 

Claim 1, as cunently amended, defines a method for automatically publishing 
data in a fmaj publication format, wherein the data is in the form of a newspaper 
having an original, existing format including a plurality of pages, each page having a 
predetermined layout comprising a plurality of independently standing data blocks, 
each block having an internal structure, the method comprising: obtaining a scamied 
representation of said newspaper, said representation preserving said layout, from said 
representation automaticaUy analyzing the newspaper to decompose die 
predetermined layout of each page of the newspaper in the original, existing format 
into said plurality of blocks, each block representing an object; said analyzing 
fiirthermore applying prior knowledge of newspaper structure and identifying at least 
one logical relationship between said blocks; converting each object fo an internal 
publication foimat, said internal publication format identifying and preserving said 
internal structiue of said blocks within said objects, said internal publication format 
forthermore preserving said layout as a relationship between said objects; said internal 
publication format furthermore preserving said logical relationship; and said internal 
publication format comprising a mark-up language to indicate said objects and 
features of said internal structure and rendering said internal publication format to 
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incon^orate said objects, said layout and respective internal stmctures and said logical 
relationship in the final publication format. 

Regarding independent eJaio. 1. Examiner has stated (pg. 2) that "Ferrel 
teaches from said representation automatically analyzing the newspaper to 
decompose the predetermined layout of each page of the newspaper in the 
onginal, existing format into said, phirality of blocks, each block representing an 
object; converting each obj«:t to an internal publication format, said internal 
publication format idenrifying and preserving said internal structure of said 
blocks within said objects, said internal publication format furthermore 
preserving said layout as a relationsWp between said objects. ... Using the 
broadest reasonable interpretation, the Examiner equates the claimed internal 
publication fonnat as equivalent to a set of predefined styles along with supporting 
macros." 

Applicants respectfiilly traverse. Ferrel teaches a system for authoring and 
publishing data. The Fen.1 system separates the design and layout definitions from 
the content definition: "One unique concept that peimeates the MP system is the clean 
separation of content and design. In this context, content is defined as the actual data 
that is to be displayed to the user. The design of a project is how that information gets 
^^splayed to the user (e.g., its format on the computer screen)." (FerreJ col 8, lines 4- 

This separation faciUtates authoring bv .„ti,nr. ^ho h.v. .^p^.,-.. 
design independently of authorinR bv authon. who i,... ,,,, ^,^„„,„ 

(Fenel col. 63, lines 13-16). The separation also facilitates publication where style 
and layout infommtion may be communicated to a remote device one time, but used 
many times for varying and updated content, thus omimizin. .... nf ^....^^ 
eommunicatinn linir. (Feirel col. 62, lines 53-65). 

Ferrel does not teach, nor even hint at, from said representation 
automatically ss^^ the newspaper to decompose the predetermined layout 
of each page of the newspaper in the original, existing format into said plurality 
of blocks, each block representing an object. The Ferrel nnhiu.=.p 
upon the authors ^o ^^ly^jMm elements «nH .^.-^^nihipT if ,.„j 

between elements, whereas the present invention perfonns intelligent and automatic' 
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analysis to feiXE cW and «l.teh>p, b«w«„ element. (For tf» 
a-scrasion retal«i to to t«Khi„g, of C«open«an, ple.« ^ below.) 

op««.oos ,h««.p„„. ,.E., „f a,i3 ^ ^ 

. conversion of those objects oenainly does not appear. 

'° "StaofA^ Examiner's con»,e«s in the interview, Applicants have added 
d^rfyutg iangnnge to the claim regaling .pp„i., „fo,>n.„.^^ „,.^ .„^^ 

and «..«frtog ,e,.. o,, r*t..„hip between ,..d bloclL and 

««1 inter..! puWicdon fom... f.„b.,„.., ^ ^ 

and re«l.ring ,aid iWer,.! p.bKe.ti.n fertn., ta..nK.r.l. ... ..id lodal 

r.i.li,..*ip. This is described in the present application in several plaees (eg 
"XML distaier module 18 preferably perfonns iaSlUggjs stmcton, analysis, in orde^ 
to be able to reoogmze and define the strucn«s and objects contained in the 
newspaper data" [0036]; "Recognition of fte for newspaper objects is an 

pan of the process" [0037]; and "the present i„v«,tio„ does not merely 
convert the data fi^m one format to a different formal but instead is able to also 
piovide at least a basic .^mict.»> c,, ^•rm'-il\r Ihr rtitn" [0024j. 

F«.el does not teach, nor even hint at said intemri -hH,.«„., 
H fm-^K ^nd pre^TvimS snid intennl stmetar. of «M bloeto within said 
objeets, said internal p.bll«ti.n f,n,., Ii.rth.rm,,, HSISEStog said hv«.t a, . 
«l.t.onsbip b«w..n »dd objects; said inl.m.1 pubUcti., fom,., to,fl,erm.„ 
preserving said logical rehUonshlp. The ,tm«,nx and „l.,ionsl,ip idenUfri,, 
and presen.!,, here clamed relate to sdd object, derived in the analyzing. Feel 
does not teach, nor even hint at, said analyxing and eettainly not Idenlifring ™, 
preserving. 

In addition, in light of the Examiner's comments during the interview 
Apphcant ha. inco,porated the language of dependent claim 2 (now canceled) to limii 
claun 1 to a method with ,aid internal pubUcation format comprising , mark-up 
•anguage to indicate said object, and features of said internal struetur. . As a 
fimher clarification, Applicant respectfully points out that XML (as claimed in 
dependent claim 3) is but one example of mark-up languages suitable for the pun,ose 
Other markup languages currenUy available that may be appropriate include HTML 
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SGML, XSL, and others. It is understood that one trained in the art may also use other 
markup languages including ones specifically crated for the purpose of the method 
descnbed in claim 1. This is discussed in Applicants' original disclosure in numerous 
places including, inter alia, "Repository 20 preferably stores these infonaation objects 
m two different data formats: a suitable Web page and/or document mark-up language 
format, such as XML for example" [0045]; «A document mark-up language such as 
XML .s particularly preferred for storing this type of information about the entity " 
[0047]; "AS a non-limiting example, the operation of the present invention ^vith regart 
to z-order may be optionally performed by using a mark-up language such as HTML 
for example." [077]; 

Applicants respectfully submit, therefore, that the claimed internal 
publicatian format is distinct from the teachings of Fenel, whose "set of predefmed 
styles along with supporting macros" (Ferrel col. 26, lines 42-43) do not include the 
clauned relationships, blocks and objects, nor an internal pubHcaHon for«,at 
comprising a mark-up language. 

In the Ferrel system, all content data is newlLcreated by authors exercising 
creativity to create new content. In the presently discussed claim 1 is claimed 
Sonverting each object to an internal publication fonnat. Such converting is not 
taught nor hinted at by Ferrel since in the Ferrel system the data is created by authors 
m a publication format from the outset. As Ferrel states: 'The MPS authoring 
environment provides a way for authors to cre»t. .tn....... ...,^,,,.» ..^^ 

create content for the MP system 100 in the MPS Document Editor 188. an author 
SISS^LAiocujsem based on the MPS template." (Ferrel col 16. lines 19-42) 
Furthermore, Ferrel teaches conversion from one publication fonnat (e.g., Microsoft 
Word, SGML. HTML, OLE (Ferrel col 62 line 66 to col 63 line 10)) to another 
publication fonnat, but not converting each object derived from gaid representation 
to an internal publication format. 

Examiner has also stated (pg. 4) that "Ferrel teaches rendering said internal 
publication format to incorporate said objects said layout and respective internal 
structures in the final publication format" citing F^Vs t^chings about style 
sheets, styles, markup languages, and various output devices. 

Applicants respectfully traverse. As shown above, the claimed internal 
pubhcation fomat is distinct from the teachings of Ferrel and the claimed rendering 
therefrom must also be distinct. 
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Exammer has fl^er stated (pg. 4) that "Ferrel does not expressly teach but 
Cooperman teaches obtaining , scanned representation of said newspaper, said 
representation preserving said layout." Applicants respectfully traverse. 

Coopeiman provides a cogent statement of a problem faced in the art of OCR 
(col. 1, lines 43-49): 

Document layout analysis is a process by which the information 
regarding the organization of the documem content, i.e. its structure is 
extracted from the document image. The stmcture identifies document entity 
types (e.e. paragraphs, figuies and tables), their properties (e.g. the number of 
columns in a table), and their interrelations (e.g., a figure is above a caption). 

Cooperman continues and explains that the an of OCR is concerned with this 
problem specifically to solve the problem of producing a document in correct reading 
order. As Cooperman states (col. 1 , lines 57-61) (emphasis added): 

Indeed, such information is important to state-of-the-art OCR systems 
that attempt to produce textual output in "readinaorder" (e.g.. appropriately 
reflecung the flow of text in columns, headers, footers, captions and now 

insete). 



As Cooperman makes clear, the problem addressed by Cooperman's 
disclosure and invention is specifically in the realm of producing a document in 
"r^admg order", and, more specifically, to addressing the problem of insets. 

Cooperman later explains that the general problem of documem layout 
analysis is not solved in the art (col. 6, lines 14-26) (emphasis added): 

For each region or segment identified within a particular page of the 
document, the following infomution is preferably provided by a segmentation 
system, (not shown): 

1) A description of each region as either text, graphic (e.g.. picture, drawing, 
etc.), ruling, timestamp, or table cell; 

2) m coordinates of the region's bounding box in absolute distance from 
either the scanned edge or the hard copy document's edge; and 
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3) For text regions, font information comprising: (a) the most common font 
(b) the font size, (c) the font style (e.g., regular, bold, italic, fixed pitch,' 
etc.), and (d) the number of lines of text within the region. 

As Coopennan states, the low-level identification of ..gions as text, graphic 
etc.. the coordinates thereof, and partial ancillary infonnation related to the fonts does 
not cpp stinite "dor„mr nt hvo,.t analvs.V "se^.nt:,t,-.„ >■ r^Z^ 

Cooperman's disclosure continues by disclosing a contribution to solving a small pari 

of the document layout analysis prohl. .^ when stating (col. 6 lines 27-36) (emphasis 
added); 

In lesponse to the above inputs and in accordance with the present 
'"venuon, a tomssUayMjnal^ witkinset detection c.p .K.-n^. 
preferably provides the following outputs: 

1 ) An identification of insets within the document; 

2) A classification of the inset type (e.g.. fhime, credit, center, column or 
stray; examples of frame, center and column depicted in FIGS, 6-9); 

3) The reading order of text regions; 

4) A grouping of text regions into sections. 

In addition to clarifying the distinction between mere gsgmentatios and 
tomteutanaUsis, Cooperman's disclosure makes clear that the contribution 
to the art therein is one of identifying insets and establishing correct reading order As 
stated, this is "...to produce the output of the document layout analysis system- 
.nformation sufficient to recreate a close app«>ximation of an original documem ss 
that the mfonnarion mByJ^pioyed to prnH„.. . fi,^ ^, 

proce,sgrfon,, t76r jnFrGO),h,„., ^ ^ approximation to the 

appearance of the original document." (col. 6, lines 44-50) (emphasis added) 

Furthermore, it is clear that when Coopeiman states that a product of the 
process is a fJe that «W a close approximation to the appearance of the original 
document" there isjlointeaLtocom^^ 

^''^ ^« inp uts M ontpnts »re in sn ffi cler,t to Ho rh. only intent is to 
achieve an "appffiximatign (eg., "the most common font" and not ail font 
information), and not an exact replica with added features. 
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Significantly, the stated goal of Cooperman's embodiments is " to produce a 
formatted fife.. .that is editable ," The Coopennan disclosure does not teach nor even 
hint at creating a fiilly decomposed and annot ated fi e, marlc ed-uo) intentipdiiifft 
rgpTesentaUon suitable for re-publication with interactive features in both the original 
form and arbitrary and multiple new fonns, as claimed in the currently amended 
claims. Coopeiman also does not teach nor even hint at re-publication with interactive 
figaJures as enabled by said analyziog furthermore applyiog prior knowledge of 
newspaper structure aiid identiQring at least one logical relationship between 
said blocks and internal publication format furthermore preserving said logical 
relationship and said internal publication format comprising a mark-up 
language to indicate said objects and features of said internal stmcture in either 
the original form or arbitrary and multiple new forms, as claimed in the currenUy 
amended claims. 

As described in the application, embodiments of the present invention provide 
re-publication of the ori ginal document with interactive features in both the exact 
original form and arbitrary and multiple new forms based upon a fallv decomposed 
and annotated ri.e. marked-up) inte rmediate representatior^ . Furthermore, the 
intelligent layout analysis employed by embodiments of the present invention results 
in identification of high-level iopic al objects in the original document (e.g., an 
"article, an advertisement, a picture which is not otherwise associated with an article 
or advertisement, and general data, which covers information that does not fit into one 
of the preceding categories. Examples of general data include, but are not limited to, 
crossword puzzles, television guide listings, stock quotes and so forth." [0036]. Other 
types of identified and classified objects mentioned (but not limited to these) in the 
application include: Tities, subtiUes, Roof Utle, Abstract, Bylines, Lead, Continuation. 
Captions, Graphic elements (lines, frames, decorations). Sections, Subsections. 

The intelligent layout analysis also goes further in analyzing objects and 
decomposes them into primitives, which are also stored in the repository ("A 
primitive is a rectangular region of a printed newspaper page, which contains textual 
and/or graphical information. Articles and other entities on the printed newspaper 
page are optionally and preferably represented as a "puzzle", or struchired collection, 
of the rectangular primitives from which the printed page is composed." [0047] 
"Primitives preferably comprise lines, words, and quads (which together form broken 
or multiple-style words). [0048]" 
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Another feature of the present invention is that it analyzes the elements so as 
to identify each element and how each el^.nt. is logically r.i.fed to all nth^r 
Slemsnts. For example, it may be detemiined that a certain text belongs to a certain 
picture. A new interactive version of the document, or subset of the document, can be 
published that allows one to click on the picture and get the related text. Coopennan 
does not teach, nor even hint at. automaticaHy analyzing the newspaper to 
decompose the predetermined layout of each page of the newspaper ia the 
original, existing format into said plnraUty of blocks, each block representing an 
object; said analyzing furthermore applying prior knowledge of newspaper 
structure and identifying at least one logical relationship between said blocks; 
converting each object to an internal publication format, said internal 
publication format identifying and preserving said internal structure of said 
blocks within said objects, said internal pubUcation format furthermore 
preserving said layout as a relationship between said objects; said internal 
publication format furthermore preserving said logical relationship; and said 
internal publication format comprising a mark-up language to indicate said 
objects and features of said internal structure and rendering said internal 
publication format to incorporate said objects, said layout and respective 
internal structures and said logical relationship in tiie fual publication format. 

This is discussed in the present application, specifically at [0025], wherein it is 
stated that "the structure of the published content uses the interactive capabilities of 
the Web page to their fullest extent, by enabling the user to view different portions of 
the content in any desired order, with links between thps e nfirtinn. hpinp rr..A. 
apg^yding to the information in the content anrf/n r according to the tvne of cnntPnt 
rather than according to an arbitrary structure imposed according to the requirements 
of the printed newspaper" and at [0029] "This process results in a set of enhanced. 
structured files which co mbine the original image of the H.t. optionally in the TIF 
fomiat as previously described, but preferably in the GIF and/or JPG formats, with the 
t^XtaqdPir.ipformf itio n . Each such file thus preferably maintains the visual aspects 
of the newspaper layout, while enabling far greater functinn;.^f ^y to be avaiUble 
through the Web page version of the newspaper" and [0036] "Next. XML disHUer 
module 18 preferably performs intelligent stmcture an^ly^i. in order to be able to 
recognize and define the structures and objects contained in the newspaper data, 
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particularly vrith regard to each page of the newspaper" and [0046} wherein is stated 
that for the entiUes stored in the repository "the mark-up language format, such as 
XML, preferably contains the actual content of the newspaper object, as v^ll as 
detail, pf th e lnte ^ ,^ ^n.m.r. nhd lel^ tjonshipfs) h.t,ve.^. its variou. ..».p...... » 

(emphasis added) 

The automatic recognition of rekUonships also includes structure relations 
such as that stated in (0037]: "Recognition of the hierarchy for newspaper objects is 
an important part of the process. One illustrative, non-limiting example of such a 
hierarchy for recognized newspaper objects is the following- 

Newspaper^Section^Page^Subsection-^Article-^Photograph^Caption^Photogr 

apher (by name)." 

In summary, as clearly illustrated by the above examples, embodiments of the 
present invention perfonn, inter alia, intelligenutmct^^ providing the 
iMaficati on of hi«h-level o bjec ts and primitive, in the original document as well as 
numerous and complex relationships between the objects, which is not taught in the 
cited prior art. 



With regard to Coopemian specifically. Coopennan does not teach, nor even 
iunt at. said analyzing furthermore identifying logical structures and 
relationship, between blocks and objects as claimed in the cuirenUy amended 



claim 



TTius. even if the teachings of Ferrel were combined with those of Cooperman, 
one would not obtain the method of the current claim. 

It is thus believed that claim 1 is both novel and inventive over the prior art 
and respectfully maintained that the claim sAould be allowed. 

Claim 19. as currently amended, defines a system for automatically publishing 
received data of a pre-existing newspaper, the newspaper in a computerized fonnat, the 
system comprising: (a) at least one source of newspaper data, said source preserving an 
ongmal. structure of the newspaper, the computerized fomiat comprising a plurality of pages 
each page having a predetermined layout comprising a plurality of independenUy standing 
data blocks, each block having an internal structure; (b) a mark-up language distiller module 
for converting the newspaper fo,m said original format to a mark-up language fonnat 
wherem said mark-up language distiller module applies prior knowledge of ne^^pape^ 
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T' "^'^"^ -^'^ ^-^ ~ 

*3UHe, m«,„,e a™^, ^^„^ ^^^^ ^ J 

R=ga,di»e indn,„d«. .Wo, I,. E«mi„er h« ,ta«d (pg. 5) to "Fe«l 
.c«h« . „„k..p dW., ^^^^ ^ 

».d oris.., to . „„k.„p b,^^. 

Uag««. da«n.r m«l„,e .„,.„.H.rt,y ^ 

.rigi..!, ..bd-g digta. f.™« d.„„p.„ „^,^^,^ 

pyty bl«ta, «c<, block „M, .«d toun... „p ^ 

»d^d«, d.U .b,«., «.b ,bj.c, b.vi.g ,.„, 

d.U, »cb «.b ,bi«, i. to m.*., h»gu.g. ft,™., 

markup h,M u.g. fo™.. .„.,^,j ^^^^„ 

(P8. 6) "U«„g rte bK»dea r««onabte int«p«tatio„, d» Examiner equates the 
*»,ed a markup ,i,ai.r „d ta«™. „p^„. „ 

aidH,«,d«rt dau ,bi«, aa equivalem ,« . «. of p,«l.f„ed «y.ea along ^ft 

supporting raacros." 

Applicant reapeotMl, ttaverse. A, diacus^d above „g.rf ,„ ^ai™ 1 
F«re|-, syaen, is „« „|,e« aurora create „e„ oonten, penel does „„. ttach ^ 
hi„, at, . ,„ .„to«.««,H , pabUshtag i„, . ^^^^ 

.^.spaper. „„ does Fe„.l teach, „„r even M„, .,, ^ ^ac^"^ 
.^t..n,a..llr a-^lyr,, fte o.„,p.p., d.^ ..y .^^.i .^^^^ 
f.n.a.l!uUEsmj2s. a,. „e„p.p„ 

y .Id ta,.„., a,n..,„e ,.pr.s«,H.g i^.p„,.„, 

Iw'tag contest and at least one adribm, of tbe data. 

b light of the Examiner's comments. Applicants have added clarifying 
language to the claim t^garding ho« a m.*.p tangnage distiller „odn,. .ppB„ 
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pnor koowledge of newspaper sin.ct„re and aulooiatically analyzes the 
newspaper dato In said origin.,, exisl«.g digital format to ... idemify logical 
-^Uonships between blocks and objects, said markup language distiUer module 
ftirther analyzing and preserving said logical relationship,. tTus is described in 
the present application in several places (e.g., "XML distiller module 18 preferably 
perfonns intelligent structure analysis, in order to be able to recognize and define the 
stnactures and objects contained in the newspaper data" [0036], "Recognition of the 
hierarchy for newspaper objects is an important part of the process" [0037]- and 'the 
present invention does not merely convert the data from one fonnat to a different 
!r|;oT"' is able to also provide at least a basic structure for organi..g the 

Applicant, have also taken advantage of this opportunity to edit the earlier 
language in the claim to clarify that it is the said markup tanguage distiller module 
fiirther analyzing and preserving said structure. 

Ferrel's "set of predefined styles along with supporting macios" are provided 
by a "template" used by an author to create a new document (Ferrel col 26, lines 42- 
43). The claimed internal structure is used to represe„t(ingj an independent data 
object derived as a result of the process where said mark-up language distiller 

''PP"^^ PPor knrMvled^. of n.^,p ,p .^eomaticallv 

the newspaper data in said original, existing digit., format to 
decoffifiose the newspaper data into said plurality of Mocks, each block with said 
.nte^al structure representing an independent data object, each object having 
content and at least one attribute of the data, such that each object is conveited 
to said markup huguage format, said markup l«.g„age distiller module further 
anaftrzing and preserving said structure" Fcrrel does not teach, nor even hint at. a 
language distiller module that has prior knowledge of newspaper structure and 
autopaHcally analyzes a preexisting newspaper to decompose it into blocks and 
structurts and identify at least one logical relationship between blocks. 

The claimed internal structure preserves the repmentation and structures 
«id relationships derived from the analysis. No such capability is taught by Ferrel 
nor, as explained in detail above in the discussion of claim 1, by Cooperman. and 
certainly not in the combination. 

It is thus believed that claim 19 is both novel and inventive over the prior 
art and respectfully maintained that the claim should be allowed. 
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publ«ta„g to of a p««ds.i«e doc™™, i„ a fi„., p„wi^^ 

daa « received ■„ a» origW, «d«i„g fo™„ oompming a pWty „f p.g« ea^, 

page havi,^ a prede.e™i„=d ,.,ou. co„,pri»,« a plotaiity of i„dep.nd»,ly s«Ldi„g 

<lamM«cta,eachb)oclhavtaga.i«ema,„„„,^a„„,^^„„^.^j^^ 

a rcp««„Mon of >h. peexisd.^ docomen, fte repr«e„,aUo„ preserving a,. 

tayoo.; from fte represenwion, aWomatoliy andyzi,^ fte preexisU,^, ,„ 

decompose *e data received in d.e «ie,™,, exi«i„g f„„„„ , p,^;^ ^^.^ 

each ob,«. com«pa,di„g to «,e of Ae blocl«; aaid a.«,yei„g ' 

pnorla.ow.edgeofdocumen.s.n.c,urea»dide«iiyi„ga,lea«o„elogical«ia«onsh^^ 
be«.e«, said blocks; prepari.^ a lis. of »x, and/or graphic element for each object- 
dettnnmmg properties of each element, inciudi,^ determining visibility and overlap 
"sties for each graphic element within Ute object; ..cognizing ,.mcn.al 
kcom properties of the data in an original fonna.; converting each obj«=, to an 
m.em.1 pubUcatioo form.,; said internal public^ion foma. fi«hen„„„ f^^^^ 
said logical relaUonship; and said internal p„WicaU„n fc™, eompri,,^ , 
language ,o indicate said objects and fe«„res of said tatemal sttuonre and re«lering 
the mtcnal publicaUon fonnat in the final publicaUon fonna^ the final publicaUon 
tot pteseoBng fte blocks as fte independently standing ohjects incorporating the 
.ntemal staictnre and Uie layoa, and said logical reUttionship. 

As explained in dettil above whh regard to claim I, neither Ferrel m,r 
Coopemian teaches the intelligent structure analysis required for .o,.„,«e.,,y 
"..lyjing tb. pr.«dsB„g d..„„,., ^ ^ ^ ^ 

on^ exUBng f.n„., h,. . p,„^^ „„y^ e,r,«p.,dtog (o 

... of «.id block, said ^..lyztog fcr.h.,m„„ applytag prior knowledge of 
d«:,un.« ,.r».h.r. and ld«,ttytag ., ™. ^ ^^^^ 
Wocksi and it is certainly not taught in the combinaUon. 

Examiner has stated (pg. 9) that "Fe™! teaches prqaring a lis, of .«« 
«.*.r graphic element for ead. object. Specifically, Ferrel disclos«> the metftod 
of detemuning properties of each element, includmg detSBiaiagviaaia (as laugh, 
by Fenell a, col. 26, lines 1 5-25), and overlap char«*,istics for each graphic element 
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within said Object (as taught by Fenell at col. 36, line 25-67, and at col. 38, lines 30- 



Fem=rs tt^hings «l«e lo an "Autotag Subsysten.- (Fe^l, col. 26, U« 
13). An authonng system is »m, „ed b, hun,«, .„fl,„s to create n«, c««™, As 
F™I continue, (col. 26. line 14-25), "Content is separ«ed from design in ,!« MP 
syst^n 100. 1, a„ Vie«e, 202 (FIG. 2), contem and design are bought together by 
controls to display a tiUe as specified by ...e designer. As a resul^hese controls need 
.0 .fatify different Cenren,, ^ ^ ^ 

co.«eUy, This is done by ereaSng structmed content. He MPS authoring 
environment ptoyide, . ^ f„ jujiisaJP ere«e stn.cn,n^ ^ ^Ps 

authonng environment includes the MPS Document Editor 1«8, «hich suppom the 
oreuron of stnrctured documents, insertionof links and the appUc.ti„„ of properties to 
these documents for content retrieval." 

While Feitel's described "authoring enviK,nment provides a way for authors 
^^-^m^^^MsUocmm", claim 23 is for a method for .utooiaHcny 
fiablisfeiagdjsia of, pree,i,ting document in a final publication format, wherein 
"''^^ived in an orirtnal. i«i,tinp f..^^. comprising a plurality of 
pages, each page having a predetermined layout. Fenel does not teach, nor even 
hint at au^p,atjcailYpuhlishinM^>t> of a preexistin.. document. 

Furthermore, Ferrel's teaching "these controls need to identify different 
elements in the stmcture of the content so they may format it cortecUy" has to do with 
the Identification of individual elements as being of one type or another so that «they 
may fbnnat it correctly" and does not teach nor even hint at compiling or in any way 
preparing a list of text and/or graphic elements for each object. 

It is thus believed that claim 23 is both novel and mventive over the prior 
art and respectfully maintained that the claim should be allowed. 

Claim 24, as cuirently amended, defines a method for automatically 
pubhshing received data in a final publication fommt. the data having an original 
existing format comprising a plurality of pages, each page having a predetermined 
layout comprising a plurality of independenUy standing data blocks, the method 
compnsmg: obtaining a representation of the received data having an original existing 
fom^at, the representation preserving the layout; from the representation 
automatically analyzing the pages to decompose the pages of the data received in the 
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Original, existing fonnat into a plurality of objects, the objects corresponding to the 
blocks; said analyzing furthennore applying prior knowledge of page sm^cture and 
•dentifymg at bast one logical relationship between said blocks; preparing a list of 
text and/or graphic elements for each object; detenuining properties of each element 
including detennining a special characteristic for each text element; recognizing 
stmctur^ layout properties of the data in an original for^t; convening each object to 
an internal publication format; said internal publication format identifying and 
preservmg said internal structu,. of said blocks within said objects, said internal 
publication format furthermore preserving said layout as a relationship between said 
objects; said internal publication format furthermore presenting said logical 
relationship; and said internal publication fonnat comprising a mark-up language to 
indicate said objects and features of said internal structure and rendering the intenuj 
publication format in the final publication format such as to include the recognized 
structure in the objects and said logical relationship and to include the layout. 

As explained above in detail with regard to claim 1, neither Feirel nor 
Cooperman teaches the intelligent structu.. analysis required for automatically 
aaaliaag the pages UL^ecoinfifige th. pages of the data received in the original 
existing fonnat into a plnraUty of objects, said objecl. corresponding to said' 
blocks; said analyzing furthermore applying prior knowledge of page structure 
and identifying at least one logical relationship between said blocks; prepariog a 
hst of text and/or graphic elements for each object; determining n^n.wi,. „r 
Sachjd^t. including determining a special characteristic for each text 
i:££g B'^"g 'Structural lavo„. .r.^ ^^ of the data in an original 
fonnat nor the use of an internal publication fonnat comprising a mark-up 
language and these are certainly not taught in the combination. 

Other details of the Examiners objections have been addressed above as weU. 
It is thus believed that claim 24 is both novel and inventive over the prior 
art and respectfuUy maintained that the claim should be allowed. 

Claim 25 defines a method for automatically publishing received data in a 
final publicadon format, wherein the data is in the fomi of a newspaper the 
newspaper having an original, existing format comprising a plurality of pages each 
page having a predetermined layout comprising a plurality of independently standing 
data blocks, each block having stnictural layout properties, the method comprising- 
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owning a representa^ „f ^ 

<tompo« «,e d«a revived i„ *c odgi,»,, exisdng f„m„, i„.„ . p.^^^ ^ 

iu«h=rm«.„,yi,,,pri<,, knowledge of newspaper s»u«„„ and identi^ng „ 
o« log-cal relati^ship b«w«. .aid block, preparing a «« of and/or gn^Wo 
^emen« for each object; detennining proper^e, of each elemen, ,ccogni^„g ^ 
^ucmral byou, p«per«es of datt in an original fom«„ deten,nni„g ..oh «« 
»gn.em for each objec, building . Uo^ fe„ . ^.^^ ^ 

se^enta; convening each object to an intt^al pnblicadon fo™,a,; said inBa»l 
pubheata for™.. fi«herino« p,e«,vi„g said iogicd reMonship; and add internal 
publ^auon fomm comprising a mark-up language u> indicate aaid objects and 
fatoes of aaid internal stn.c.„,e and re«leri„g said internal publicaUon fonn« in the 
fin., publication fonnat to comprise said blocks as objects i„c„rpon«ing said 
•wctural layout properties, and to include said layout and said logical relationship. 

As expl^ned .hove in det«l with regard to cWm 1, neither Fetrel nor 
Cooperman teaches the MigauaauEJMtofa .pplying prior laowfedge of 
««.sp.p„ ..racre required for .u..m.«e.l,. ^^^^^ ^ 

S^mim dM. ,«d,ed i. ttte .Hginal, «drt.g for».t In., . „, 
.b,««, sdd objees ,o„«p.«dlng .0 said todepaidenfly s.aBdi„g bk«k„ „d 
«M analysing fartbenoor. .ppiytag prjo, un«wMg, ,f „.w,p.p., 
and ld.n«(,tog feast one logical relaMonship brtw«,n said bl«*s, preparitMs . 
l»t of tex. and/or grapW. aan«u> for e.di objert, delem.!..... -^ p,^„ 
end. de««i„ tecgi^biBp ^ ,^^„, ,.,„„, ^ 

.rig«.l form.,, de.«.mlntog each ta. «igm«,. for ...h obje..; baildlng a to,, 
block tron, . pinndiQ. of .lig..«, to, ^ ^ 

in.en..l p.blie.«.. t^, „„ u„ „f ^ ^^^^^^ ^^^^ 

f«r*erm,re preservUig said togicl ,eh,tt,.ship, «d nid internal p.bll..«,n 
forma, comprising . mark-up Jangnag, |, indicate said object, and feature, of 
said i..«i,al «rueftire and these ate cenainly not aught in the combination. 

Other detals of the Examiners objections have been addressed .bove m well 
It is thus believed ttiat claim 25 is both novel and inventive over the prior 
an and respectfully maintained that the claim should be allowed. 
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behoved ,„ b.^,„„aM. „ ^ „„ ^ ^ 

All Of the matte raised by fte E^i^ have bee„ deal, »im and a:e 
believed to have been overcome. » ana are 

In «-ew of ±e foregotag. j, ^^^^ ^ 

pending u, the application are allo««bie. 

An «.rly Notice of AUo»a„ce is therefore respectfhlly re-pres^d. 
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